Extracellular mucin detected by deep learning from histopathology images predicts consensus molecular subtypes in colorectal cancer
Colorectal cancers are a heterogeneous group of tumors, from the histomorphological, clinical and molecular points-of-view. In terms of molecular changes, one particularly well-described genomic aberration is microsatellite instability (MSI), affecting approximately 15% of all cancers and is an important factor affecting treatment decisions in colorectal cancer. Attempts have been made to predict MSI cancers using only the histopathology image (Bychkovetal 2018). In fact, experienced pathologists can often identify MSI cases simply be glancing at the stained slide. Mucin is not exclusively found in MSI-high patients, and the prognosis associated with mucinous histology is controversial.
In order to better understand the role of extracellular mucin in colorectal cancer, we created a deep learning classifier to quantify the extracellular mucin-to-tumor area ratio in three independent cohorts (Bern n=517, TCGA n=412 & CPTAC n=85) and investigate the genotype-phenotype correlation of mucin with CMS groups, MSI status and gene expression of mucin-producing genes.
A large retrospective cohort of 517 primary colorectal cancer patients diagnosed at the Institute of Pathology between 2002-2018 and treated at the Insel Hospital Bern (Switzerland)
The Cancer Genome Atlas (TCGA) data (Cancer Genome Atlas 2012) from public repositories at the National Institutes of Health (NIH; USA). 412 patients cases have been selected.
A public dataset The Clinical Proteomic Tumor Analysis Consortium (CPTAC) from the National Cancer Institute (NCI; USA) was also included in this study. The 373 H&E WSIs of 106 colon adenocarcinoma patients were downloaded from The Cancer Imaging Archive (TCIA) (National Cancer Institute Clinical Proteomic Tumor Analysis Consortium 2020).
A novel precise tissue segmentation of histopathology images using deep learning, namely Group Affinity Weakly Supervised segmentation (GAWS) is proposed. It processes one histopathology image and some patches of prior tissue as input with three main steps.
First, we tested the inter-observer agreement of extracellular mucin component, recorded as the percentage of total tumor area covered by extracellular mucin, between two pathologists. The results were excellent [pathologist 1 ICC=0.915 (95%CI: 0.885-0.937) and pathologist 2 ICC=0.923 (95%CI: 0.896-0.943)].
In CMS1 patients, mucinous histology is clearly a favorable factor, whereas in stage II and CMS3 patients, patients experience a much more unfavorable outcome. This may help to add to the current discrepancies found in the literature with regard to the impact of this histological feature and outcome (REF).
Mucinous histology is independent of microsatellite instability(MSI) status. On the one hand, mucin is often a feature of MSI, and a likely consequence of genomic instability affecting mucin-producing genes. On the other, it occurs in non-MSI cases as well. Our work here based on only 2 features, namely MSI status and the presence of extracellular mucin, helps to narrow down identification of CMS groups. Nearly all “mucinous” tumors with MSS are CMS3 or CMS4.
In this study, we show that mucinous histology and moreover, the presence of extracellular mucin in colorectal cancer, can help predict the underlying Consensus Molecular Subgroup(CMS) group, independently of the MSI status. CMS2 cancers rarely show any mucin, a result validated by gene expression analysis of four mucin producing genes - MUC2, MUC4, MUC5AC and MUC5B.
Extracellular mucin is an indicator of a phenotype-genotype correlation and together with MSI status can differentiate between CMS1 and CMS3 cancers. Secondly, extracellular mucin is a feature of poor outcome in patients with stage II cancers and CMS3 tumors, despite the more indolent features associated with a mucin-producing phenotype and should be considered during clinical management.